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This article is about temporal multi-agent logics. Several of these formalisms have been already 
presented (ATL-ATL*, ATL vo SL). They enable to express the capabilities of agents in a system to 
ensure the satisfaction of temporal properties. Particularly, SL and ATL SC enable several agents to 
interact in a context mixing the different strategies they play in a semantical game. We generalize 
this possibility by proposing a new formalism, Updating Strategy Logic (USL). In USL, an agent can 
also refine its own strategy. The gain in expressive power rises the notion of sustainable capabilities 
for agents. 

USL is built from SL. It mainly brings to SL the two following modifications: semantically, the 
successor of a given state is not uniquely determined by the data of one choice from each agent. 
Syntactically, we introduce in the language an operator, called an unbinder, which explicitly deletes 
the binding of a strategy to an agent. We show that USL is strictly more expressive than SL. 

1 Introduction 

Multi-agent logics are receiving growing interest in contemporary research. Since the seminal work of 
Rajeev Alur, Thomas A. Henzinger, and Orna Kupferman [2], one major and recent direction (ATL with 
Strategy Context E]|6]|7l, Strategy Logic (presented first in and then extended in dHlOl) aims at 
contextualizing the statements of capabilities of agents. 

Basically, multi-agent logics enable assertions about the capability of agents to ensure temporal 
properties. Thus, ATL-ATL* [2] appears as a generalization of CTL-CTL*, in which the path quantifiers 
E and A are replaced by strategy quantifiers. Strategy quantifiers (the existential ((A)) and the universal 
|AJ) have a (coalition of) agent(s) as parameter. ((A)) (p means that agents in A can act so as to ensure the 
satisfaction of temporal formula (p. It is interpreted in Concurrent Game Structures (CGS), where agents 
can make choices influencing the execution in the system. Formula ((A)) q> is true if agents in A have a 
strategy so that if playing it they force the execution to satisfy <p, whatever the other agents do. 

A natural question is: how to interpret the imbrication of several quantifiers? Precisely, in the inter- 
pretation of such formula as 

y/i := «ai)}D(<piA((a 2 ))n<p2) 

(where Ocp is the temporal operator meaning cp is always true, and a\ and a 2 are agents), is the evaluation 
of <p2 made in a context that takes into account both the strategy quantified in ((ai)) and the strategy 
quantified in ((02))? 

In ATL-ATL*, only a 2 is bound: subformula ((a 2 ))n<?>2 is true iff a 2 may ensure D(p2, whatever the 
other agents do. Then ((a 2 )) stands for three successive operations: First, each agent is unbound from its 
current strategy, then an existential quantification is made for strategy a. At last, a 2 is bound to strategy 
a. 

ATL SC (3l|6l|7l, while keeping the ATL syntax, adapts the semantics in order to interpret formulas in 
a context which stores strategies introduced by earlier quantifiers. 
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Strategy Logic (SL (8j[T0|]) is another interesting proposition, which distinguishes between the quan- 
tifications over strategies and their bindings to agents. The operator ((a)) is split into two different 
operators: a quantifier over strategies (((x)), where x is a strategy variable) and a binder ((a,x), where a 
is an agent) that stores into a context the information that a plays along the strategy instantiating variable 
x (let us write it o x in the remaining of this paper).The ATL formula \jf\ syntactically matches the SL: 

¥2 ■= ((xi})0i,*ipOi A((x 2 ))(a 2 ,x 2 )n<p2) 

In Y2, when evaluating D(p2, a\ remains bound to strategy a Xl except if a\ and a 2 are the same agent. If 
they are the same, the binder (a2,x 2 ) unbinds a from its current strategies before binding her to a X2 . 

In this paper we present USL, a logic obtained from SL by making explicit the unbinding of strategies 
and allowing new bindings without previous unbinding. For that, we introduce an explicit unbinder 
(a tf> x) in the syntax (and the binder in USL is written {a Ox)) and we interpret USL in models where 
the choices of agents are represented by the set of potential successors they enable from the current state. 
When there is no occurrence of an unbinder, each agent remains bound to her current strategies. Then 
different strategies can combine together even for a single agent, provided that they are coherent, which 
means they define choices in non-empty intersection (the notion is formally defined in Sect.|2]). 

The main interest in such introduction is to distinguish between cases where an agent composes 
strategies together and situations where she revokes a current strategy for playing an other one. If a\ and 
a 2 are the same agents, then \\r 2 is written in SL: 

Y3 ■= ((xi))(a,xi)D(<pi A ((x 2 ))(a,x 2 p<P2), 

which syntactically matches the USL: 

V4 := ((xi))(a >xi)D(<pi A((x 2 ))(a >x 2 )n<jj2) 

In 1//3, subformula ((x 2 ))(a,JC2)n<P2 states that a can adopt a new strategy that ensures D(f>2, no matter 
if it is coherent with the strategy o Xl previously adopted. In 1//4, both strategies must combine coherently 
together. In natural language 1//4 states that a can ensure q>\ and leave open the possibility to ensure (fa 
in addition. The equivalent of 1//3 in USL is actually not 1//4 but 

V5 := ((xi))(a>xi)D(<pi A((x 2 ))(a^xi)(a>x 2 )D<P2) 

There indeed, in subformula (a tf> xi)(a \> x 2 )D<j>2, a is first unbound from o X{ and then bound to o X2 . 

A consequence of considering these compositions of strategies is the expressiveness of sustainable 
capabilities of agents. Let us now consider the USL formula: 

¥6 ■= >x l )n({(x 2 ))(a $>xi)(a >x 2 )X p) 

There the binder (a \> x 2 ) is used with the unbinder (a tf> x\ ), so that % is equivalent to the SL: 

¥1 ■= ((xi))(a,xi)D(((x 2 )}(a,x 2 )X p) 

It states that a can remain capable to perform the condition expressed by X p when she wants. But in 
case she actually performs it, the formula satisfaction does not require that she is still capable to perform 
it. The statement holds in state sq in structure ^\ with single agent a. See Fig{T] where choices are 
defined by the set of transitions they enable. Since j%\ interprets SL formulas with only agent a, the 
choices for a are deterministic: let s,s' be two states and c a choice, then the transition from s to s' is 
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Figure 1: Structure jjt\ 



labelled with c iff {s'} is a choice for a at s. Indeed, by always playing choice c\,a remains in state sq, 
where she can change her mind to ensure p. But if she chooses to reach p, she can do it only by moving 
to state s\ and then to state S2- Doing so, she loses her capability to ensure X p at any time. The only 
way for her to maintain her capability to reach p is to always avoid it, her capability is not sustainable. 

A more game theoretical view is to consider strategies as commitments. In such view, by adopting 
a strategy, a adopts a behavior that holds in the following execution, as far as it is not explicitly deleted. 
Formula 

Y& ■= {(xi})(a>x 1 )D(((x 2 )}(a>X2)Xp) 

is the counterpart of formula Xjfj with such interpretation of composing strategies for a single agent. If 
a plays a X2 , it must be coherently with a Xl . Thus, y% is false in structure dt\, since a cannot achieve p 
more than once. 

Formula distinguishes between structures and ^#2 from Figf2] ( Note that in this second 
structure the choices are not deterministic: from a given state a choice may be compatible with several 
potential successors). In ^Mi, y/g is true at s since the strategy always play c\ ensure the execution to 
remain in state so or s\ and is always coherent with strategy play C2 first and then always play c\, which 
ensures X p from states s and s\. What is at stake with it is the difference between sustainable capa- 
bilities and one shot capabilities. Formulas 1//7 and both formalize the natural language proposition a 
can always achieve p. One shot capability (1//7) means she can achieve it once for all and choose when. 
Sustainable capability (y/g) means she can achieve it and choose when without affecting nor losing this 
capability for the future. 




ci,c 2 



Figure 2: Structure ^2 



In SectfJJ we compare the expressive power of SL and USL by use of formula 1//9, obtained from 1//7 
by adding to a the sustainable capability to ensure X ->p: 



y 9 := {(x))(a > x)D(({x ))(a > x )X p A {(x Q ))(a > x )X ->p) 



1//9 states that a has sustainable capability to decide whether p or ->p holds at next state. We say that a 
has sustainable control on property p: she is sustainably capable to decide the truth value of p. 

The main purposes of USL are to give a formalism for the composition of strategies and to unify 
it with the classical branching-time mechanisms of strategy revocation. So, both treatments can be 
combined in a single formalism. In the remaining of this paper we define USL syntax and semantics, and 
we introduce the comparison of its expressive power with that of SL. 
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2 Syntax and semantics 

In this section we present the syntax and semantics of USL, together with the related definitions they 
require. The USL formulas distinguish between path and state formulas. 

Definition 1. Let Ag be a set of agents, At a set of propositions andX a set of variables, USL (Ag,At,X) 
is given by the following grammar: 

• State formulas: (p ::= p | -><p | (p A <p | ((x))<p \ (A > x)\j/ \ (A g£ x)\j/ 

• Path formulas: \j/ ::= (p \ \ y A y/" | y U y | X y 

where p G At, A C Ag,x G X. 

These formulas hold a notion of free variable that is similar to that in [EJEEO] : an atom has an empty 
set of free variables, a binder adds a free variable to the set of free variables of its direct subformula 
and a quantifier deletes it. Upon formulas on this grammar, those that can be evaluated with no context 
are the sentences. They are formulas with empty set of free variables, which means each of their bound 
variables is previously quantified. We now come to the definitions for USL semantics. 

Definition 2. A Non-deterministic Alternating Transition System (NATS) is a tuple 
Ji = (Ag,M,At,v,Ch) where: 

• M is a set of states, called the domain of the NATS, At is the set of atomic propositions and v is a 
valuation function, from M to &(At). 

• Ch: AgxM — > 3^(3^(M)) is a choice function mapping a pair (agent, state) to a non-empty family 
of choices of possible next states. It is such that for every state s G M and for every agents a\ and 
a2 in Ag, for every c\ G Ch(a\,s) and C2 G Ch(a2,s),c\ (lC2 ^ 0- 

We call a finite sequence of states in M a track T. The last element of a track z is denoted by last(z). 
The set of tracks that are possible in is denoted by track : T = sqs^ . . . Sk G tracks iff for every i < k, 
for every a G Ag, there is c a G 3?(M) s.t. c a G Ch(a,s{) and G c a . Similarly, an infinite sequence of 
states such that all its prefixes are in track £ is called a path (in ^#). 

Definition 3 (Strategies and coherence). A strategy is a function ofromAg x track j( to &(M) such that 
for all (a, t) G Ag x track jg, a(a, t) G Ch(a,last(x)). By extension, we write a(A, %)for f] a€A o(a, t)for 
every A C Ag. Two strategies 0\ and 02 are coherent iff for all (a, t) in Ag x tracks, <J\ (a, t) n 02(a, t) 7^ 
0. In this case, we also say that Oi (a, t) and 02(0, z) are coherent choices. 

A commitment k is a finite sequence upon (0P(Ag) x X), representing the active bindings. An assign- 
ment a is a partial function from X to Strat. A context % is a pair of an assignment and a commitment. 
Note that an agent can appear several times in a commitment. Furthermore commitments store the or- 
der in which pairs (A,x) are introduced. Therefore our notion of contexts differs from the notion of 
assignments that is used in SL |[8l[T0ll. 

A context defines a function from tracks to &(M). We use the same notation for the context itself 
and its induced function. Let Kg be the empty sequence upon (&(Ag) x X), then: 

• {a,K Sb ){x)=M 

• (a,(A,x))(r) = 

- PUea t) if A ^0 

- elseM 



• (a,K-(A,x))(z) = 
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- (a, k)(x) n (a, (A,x))(x) if this intersection is not empty. 

- otherwise (which means the context induces contradictory choices), (a, Jc)(t) . 

Now we can define the outcomes of a context %, out {%): let % = 71q, 7t\, . . . be an infinite sequence over 
M, then % G out(s,%) iff % is a path in s = Tto and for every n G N, 7T n+ i G #(7To • • • ^«)- 

Definition 4 (Strategy and assignment translation). Let o be a strategy and X be a track. Then a T is the 
strategy s.t. for every x' G tracks , ct t (t') = o{xx'). The notion is extended to an assignment: for every 
a, a T is the assignment with domain equal to that of a and s.t. for every x G dom(a), (X z (x) = (a(x)) T 

We also define the following transformations of commitments and assignments. Given a commitment 
K, coalitions A and B, a strategy variable x, an assignment a and a strategy a : 

• k[A ->x] = k-(A>x) 

• ((B,x) ■ k)[A ^x) = (B\A,x) ■ (k[A-»x\) and K«[A -»x] = Kq 

• a[x — > a] is the assignment with domain dom(a) U {x} s.t. Vy G <iom(a)\{x}, a[x — ^ o](y) = a{y) 
and a [x — > a] (x) = a 

Definition 5 (Satisfaction relation). Let ^# be a NATS, then for every assignment a, commitment K, 
state s and path K: 

• State formulas: 

- ^#,a, K,s\= piffpE v(s), with p eAt 

- J%,a, K,s\= -i(p iff it is not true that ^#,a, K,s \= <p 

- J%,a, K,s \= <pi A (fe iff^,a, k,s \= <pi and Jt K,s \= q>2 

- a,K,s \= ((x))(p iff there is a strategy O G Strat s.t. a[x — > a], K,s \= (p 

- ^#,a, k,s \= (A D> x)(p iff for every % in out(a, k[A -tx]),^,a, k[A -tx],n\=q> 

- Jl,a, K,s\=(Aifix)(p iff for all n in out(a, k[A -»x]),^/?,a, k[A -»x],n\=<p 

• Path formulas : 

- a, K, % |= (p iff ' *rft ', a, fC, 7To |= <p, for every state formula (p 

- a,K,n \= -iyr iff it is not true that a, K,n \=\ff 

- Jt, a, k, % \= Yi A ^2 #T^, cc,K,7i \= and Jt, cc,k,k\= y/2 

- ^#,a, K,7t\=X\j/ iff ' Jt \a** ,K,n l \= y. 

- Jt, a,K,7t\=Yi U Y2 iff there is i G N s.t. Jl , a 1T(> - 7Ci - [ , K, n' \= y 2 and for every < j < 

Let a® be the unique assignment with empty domain. Let (p be a sentence in USL (Ag,At,X). Then 
Jt,s |= (p iff^,aQ,Kd, \= (p. 

Let us give the following comment over these definitions: for every context % = (a, k), the definition 
of out (x) ensures that the different binders encoded in % compose their choices together, as far as 
possible. In case two contradictory choices from an agent are encoded in the context, the priority is 
given to the first binding that was introduced in this context (the left most binding in the formula). This 
guarantees that a formula requiring the composition of two contradictory strategies is false. For example, 
suppose that ((x\))(a \> x\)(p\ and (fa)} (a t> X2)(p2 are both true in a state of a model, and suppose that 
strategies a Xl and a X2 necessarily rely on contradictory choices of a (this means that a cannot play in a 
way that ensures both (pi and (p2). Then, ((xi))(a t> *i)(<pi A (fa)) (a > X2)(p2) is false in the same state of 
the same model. If the priority was given to the most recent binding (right most binding in the formula), 
the strategy a Xl would be revoked and the formula would be satisfied. 
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3 Comparison with SL IH 

SL syntax can be basically described from SL by deleting the use of the unbinder. Furthermore, the 
binders are limited to sole agents and are written (a,x) instead of (a > x). USL appears to be more 
expressive than SL [EJQJ3. More precisely, SL can be embedded in USL, while 1//9 is not expressible 
in SL, even by extending its semantics to non-deterministic models. Here we give the three related 
propositions. By lack of space, the proofs are only sketched in this article. Detailed proofs of these 
propositions can be found in H. Note that, since SL is sttictly more expressive than ATL SC (6l, the 
following results also hold for comparing USL with ATL SC . 

Proposition 1. There is an embedding ofSL into USL. 

Proof ( Sketch ). The embedding consists in a parallel transformation from SL models and formulas to that 
of USL. The transformation preserves the satisfaction relation. The differences between SL and USL 
lie both in the definition of strategies in SL semantics and the difference of interpretation for the binding 
operator. The first is treated by defining an internal transformation for SL. By this transformation, the 
constraints of agents playing the same choices, issued from SL actions framework, are expressed in the 
syntax. Then we define a new operator in USL that is equivalent to SL binding, and show the equivalence: 
the operator [a > x] is an abbreviation for a binder (a > x) preceded by the set of unbinders (a tf> a,), one 
for every variable x, in the language. □ 

Proposition 2. A model is said deterministic if the successor of a state is uniquely determined by one 
choice for every agent. Then, sustainable control is not expressible over deterministic models, neither in 
SL nor in USL. 

Proof (Sketch). One checks that for every deterministic NATS for any state s of jft, ^^sY 1 y/9. 
Proposition Q] then straightly brings proposition |2] □ 

Proposition 3. Sustainable control is not expressible in SL interpreted over NATSs. 

Proof (Sketch). The proof uses a generalization of SL semantics over NATSs. Its definition is in H and 
holds, for example, the following cases: 

• ^,a, k,s ^nats X <p iff for every % G out(s, (a, a^, k, K\ |=nats <P 

• Jl,a, K,n ^nats <Pi U (f>2 iff for every % G out(s, (a, k)), there is i G N s.t. ^,a n °- ni - 1 ,K,7i i 
Nnats (jh and for all < j < i..//. a^ - ^-i , k, k j H^ats <Pi- 

• ^C, a,K,s ^nats ((■*}) <P iff t nere is a strategy a G Strat s.t. ,a[x —¥ o],K,s |=nats <P- 

• J?,a,K,s H-jats (a,x)cp iff JK,a,K[x\K(a)],s \=nats 9- 

where K'[A\K'(a)] designates the context obtained from K by replacing every (a,y) in it by (a, x). 

Formula 1//9 states that a can always control whether p or not. Suppose there is a formula <p in SL 
equivalent to 1//9 and let us call existential a formula in SL in which every occurrence of ((x)) is under an 
even number of quantifiers. If q> is existential then under binary trees it is equivalent to a formula in £} 
(the fragment of second order logic with only existential set quantifiers). 

We now consider a set of formulas {r,}, e N, each one stating that a can choose i times between p and 
->p. The set {r,-}^ is defined by induction over i: 

• r := ({x))(a,x)D(({xo))(a,x )X p A ({x ))(a,x )X -1/7) 



Ch. Chareton, J. Brunei & D. Chemouil 



97 



• for all i G N,n+ 1 = r t \p A n(((x i+1 )) (a,x i+1 )X p A ({x i+ i))(a,x i+1 )X ^p\p] 
[->p A □( {(x i+ 1 )) (a , x i+ 1 )X p A ((*/+ 1 )) (a , x ; -+ 1 )X ->p)\~>p] ■ 

where the notation 0i[02\03] designates the formula obtained from 6\ by replacing any occurrence of 
subformula 63 in it by 62- {Ti}^ is equivalent to (p. A compactness argument shows that it is not 
equivalent to a formula in £} under binary trees, hence 9 is not an existential formula. Then, we notice 
that (p is true in structures where, from any state, a can ensure any labelling of sequences over p. So, if 
(p has a subformula (a,x)\j/ where x is universally quantified, y must be equivalent to □(/? V -</?). Then, 
by iteration, <p is equivalent to an existential formula in SL. Hence a contradiction. □ 

4 Conclusion 

In this article we defined a strategy logic with updatable strategies. By updating a strategy, agents remain 
playing along it but add further precision to their choices. This mechanism enables to express such 
properties as sustainable capability and sustainable control. To the best of our knowledge, this is the first 
proposition for expressing such properties. Especially, the comparison introduced with SL in this article 
could be adapted to ATL with Strategy Context 

The revocation of strategies is also questioned in JT]. The authors propose a formalism with definitive 
strategies, that completely determine the behaviour of agents. They also underline the difference between 
these strategies and revocable strategies in the classical sense. We believe that updatable strategies offer 
a synthesis between both views: updatable strategies can be modified without being revoked. 

Strategies in USL can also be explicitly revoked. This idea is already present in [3 1 with the operator 
•)A(\ But the operator (-A-) also implicitly unbinds current strategy for agents in A before binding them 
a new strategy. Thus it prevents agents from updating their strategy or composing several strategies. 

Further study perspectives about USL mainly concern the model checking. Further work will provide 
it with a proof of non elementary decidability, adapted from the proof in [10]. We are also working 
on a semantics for USL under memory-less strategies and PSPACE algorithm for its model-checking. 
Satisfiability problem should also be addressed. Since SL SAT problem is not decidable, similar result is 
expectable for USL. Nevertheless, decidable fragments of USL may be studied in the future, in particular 
by following the directions given in ll9l . 
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